Analyse maisons - Shapash

Interpretation des predictions maisons

Project_Information

Author : VotreNom

Description : Rapport Shapash pour maisons

Project_Name : Analyse extratrees_maison


Model analysis

Model used : ExtraTreesRegressor

Library : sklearn.ensemble._forest

Library version : 1.5.2

Model parameters :

Parameter key Parameter value
estimator ExtraTreeRegressor()
n_estimators 235
estimator_params ('criterion', 'max_depth', 'min_samples_split', 'min_samples_leaf', 'min_weight_fraction_leaf', 'max_features', 'max_leaf_nodes', 'min_impurity_decrease', 'random_state', 'ccp_alpha', 'monotonic_cst')
bootstrap False
oob_score False
n_jobs None
random_state None
verbose 0
warm_start False
class_weight None
max_samples None
criterion squared_error
max_depth 19
min_samples_split 4
Parameter key Parameter value
min_samples_leaf 1
min_weight_fraction_leaf 0.0
max_features 1.0
max_leaf_nodes None
min_impurity_decrease 0.0
ccp_alpha 0.0
monotonic_cst None
feature_names_in_ ['etage' 'surface' 'surface_terrain' 'nb_pieces' 'balcon' 'eau' 'bain' 'dpeL' 'dpeC' 'mapCoordonneesLatitude' 'mapCoordonneesLongitude' 'annonce_exclusive' 'nb_etages' 'places_parking' 'cave' 'ges_class' 'annee_construction' 'nb_toilettes' 'ascenseur' 'chauffage_energie' 'chauffage_systeme' 'chauffage_mode'...
n_features_in_ 55
_n_samples 11132
n_outputs_ 1
_n_samples_bootstrap None
estimator_ ExtraTreeRegressor()
estimators_ [ExtraTreeRegressor(max_depth=19, min_samples_split=4, random_state=255869882), ExtraTreeRegressor(max_depth=19, min_samples_split=4, random_state=2014308808), ExtraTreeRegressor(max_depth=19, min_samples_split=4, random_state=1849920314), ExtraTreeRegressor(max_depth=19, min_samples_split=4,...

Dataset analysis

Global analysis

Training dataset Prediction dataset
number of features NaN 55
number of observations NaN 2,783
missing values NaN 0
% missing values NaN 0

Univariate analysis

etage - Categorical

Prediction dataset
distinct values 1
missing values 0

Target analysis

prix_m2_vente - Numeric

Prediction dataset
count 2,783
mean 2,710
std 951
min 150
25% 2,100
50% 2,700
75% 3,250
max 8,190

Multivariate analysis


Model explainability

Note : the explainability graphs were generated using the test set only.

Global feature importance plot

Features contribution plots

etage -


Model performance

Univariate analysis of target variable

prix_m2_vente - Numeric

True values Prediction values
count 2,783 2,783
mean 2,710 2,700
std 951 684
min 150 478
25% 2,100 2,270
50% 2,700 2,680
75% 3,250 3,110
max 8,190 6,550

Metrics

MAE : 419

R2 : 0.6

MSE : 361,000

MAPE : 0.19

MdAE : 308

Explained Variance : 0.6